Analyzing Linked Data Quality with LiQuate
نویسندگان
چکیده
The number of datasets in the Linking Open Data (LOD) cloud as well as LOD-based applications have exploded in the last years. However, because of data source heterogeneity, published data may suffer of redundancy, inconsistencies, or may be incomplete; thus, results generated by LOD-based applications may be imprecise, ambiguous, or unreliable. We demonstrate the capabilities of LiQuate (Linked Data Quality Assessment), a tool that relies on Bayesian Networks to analyze the quality of data and links in the LOD cloud.
منابع مشابه
Generating Possible Interpretations for Statistics from Linked Open Data
Statistics are very present in our daily lives. Every day, new statistics are published, showing the perceived quality of living in different cities, the corruption index of different countries, and so on. Interpreting those statistics, on the other hand, is a difficult task. Often, statistics collect only very few attributes, and it is difficult to come up with hypotheses that explain, e.g., w...
متن کاملAnalyzing Efficiency of Railway Transportation by Considering Quality of Services: New Data Envelopment Analysis Models
Many studies have been conducted to analyze efficiency of railways for different countries. However, these studies have mainly focused on quantitative aspects of railway transportation and quality has been neglected. In this paper three new data envelopment analysis (DEA) models are presented. The first model is solved for assessing quality of passenger railway services in 71 countries of t...
متن کاملLiterally better: Analyzing and improving the quality of literals
Quality is a complicated and multifarious topic in contemporary Linked Data research. The aspect of literal quality in particular has not yet been rigorously studied. Nevertheless, analyzing and improving the quality of literals is important since literals form a substantial (one in seven statements) and crucial part of the Semantic Web. Specifically, literals allow infinite value spaces to be ...
متن کاملLinking Semistructured Data on the Web
Many Web data sources and APIs make their data available in XML, JSON, or a domain-specific semi-structured format, with the goal of making the data easily accessible and usable by Web application developers. Although such data formats are more machine-processable than pure text documents, managing and analyzing such data in large scale is often nontrivial. This is mainly due to the lack of a w...
متن کاملTemporal Knowledge Extraction for Dataset Discovery
Linked data datasets are usually created with different data and metadata quality. This makes the exploration of these datasets a quite difficult task for the users. In this paper, we focus on improving discoverability of datasets based on their temporal characteristics. For this purpose, we identify the typology of temporal knowledge that can be observed inside data. We reuse existing temporal...
متن کامل